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Consistency and Random Constraint Satisfaction Models 



In this paper, we study the possibihty of designing non-trivial random CSP models by 
exploiting the intrinsic connection between structures and typical-case hardness. We show 
that constraint consistency, a notion that has been developed to improve the efficiency 
of CSP algorithms, is in fact the key to the design of random CSP models that have 
interesting phase transition behavior and guaranteed exponential resolution complexity 
without putting much restriction on the parameter of constraint tightness or the domain size 
of the problem. We propose a very flexible framework for constructing problem instances 
with interesting behavior and develop a variety of concrete methods to construct specific 
random CSP models that enforce different levels of constraint consistency. 

A series of experimental studies with interesting observations are carried out to illus- 
trate the effectiveness of introducing structural elements in random instances, to verify the 
robustness of our proposal, and to investigate features of some specific models based on 
our framework that are highly related to the behavior of backtracking search algorithms. 

1. Introduction 

The tale of random models of constraint satisfaction problems (CSP) gives us an excellent 
example of the dramatic impact of structures on the typical-case hardness of randomly- 
generated problem instances (Achlioptas et al., 1997; Gent et al., 2001; Maclntyre et al., 
1998; Molloy, 2002; Prosser, 1996; Smith, 2001). It also shows that the study of phase 
transitions of random models of NP-complete problems is indeed a viable approach to 
understanding the interplay between the structures and the efficiency of search algorithms. 

Unlike the random Boolean satisfiability problem (SAT) where structures do not seem 
to exist in randomly-generated instances, algorithmically-exploitable structures do exist 
in the more general random constraint satisfaction problem. The chief reason for this 
is that in random constraint satisfaction instances the uncontrolled existence of multiple 
nogoods within the same constraint generates small scale structures that make the instance 
unsatisfiable. Here small scale may mean only a few variables are involved, or that the 
variables involved are very highly constrained in a local way. The first example of such 
structures is the existence of flawed variables and flawed constraints in random instances 
generated from the four classical random CSP models in most of the parameter regions 
(Achlioptas et al., 1997), assuming that the domain size is fixed. The appearance of flawed 
variables and constraints makes these models trivially unsatisfiable and excludes any phase 
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transition behavior. As yet another example, the existence of embedded easy subproblems 
(Gao &: Culberson, 2003), also called "flowers" (Molloy & Salavatipour, 2003), in another 
part of the parameter space of the models makes randomly-generated instances polynomially 
sovablc by constraint consistency algorithms. 

Several new models have been proposed to overcome the trivial unsatisfiability. Gent 
et al. (2001) proposed the flawless random binary CSP based on the notion of a flawless 
conflict matrix. Instances of the flawless random CSP model are guaranteed to be arc- 
consistent, and thus do not suffer asymptotically from the problem of flawed variables. 
Achlioptas et al. (1997) proposed a nogoods-based CSP model and showed that the model 
has non-trivial asymptotic behaviors. Random CSP models with a (slowly) increasing 
domain size have also been shown to be free from the problem of flawed variables and have 
interesting threshold behavior (Xu & Li, 2000; Smith, 2001; Frieze & Molloy, 2003). 

While all of these proposed models are guaranteed to have interesting phase transition 
behavior (some of them are also guaranteed to generate hard instances at phase transi- 
tions), the fundamental relation between the structures and the typical-case hardness of 
randomly-generated CSP instances has not been seriously addressed. As a consequence, 
superficial conditions and restrictions on the parameter of constraint tightness in these 
models frequently give the (false) impression that it is the constraint tightness and/or the 
domain size that determines the behavior of random CSP instances. For the classic binary 
random CSP, the constraint tightness has to be less than the domain size in order to have 
a phase transition. The flawless random CSP does have a true solubility phase transition 
at a high constraint tightness, but as we will show later, it still suffers from embedded easy 
unsatisfiable subproblems when the constraint tightness is greater than the domain size. In 
CSP models with increasing domain size, there is still an obvious restriction on the possible 
values of the constraint tightness. In the nogood-based CSP model, it is impossible to have 
a high constraint tightness without making the constraint (hyper) graph very dense. 

In this paper, we study the possibility of designing non-trivial random CSP models 
by exploiting the intrinsic connection between structures and typical-case hardness. For 
this purpose, we show that consistency, a notion that has been developed to improve the 
efficiency of CSP algorithms, is in fact the key to the design of random CSP models that have 
interesting phase transition behaviors and guaranteed exponential resolution complexity 
without putting much restriction on the parameter of constraint tightness or the domain 
size. 

In Section 4 we report a scries of experimental studies. Algorithmic studies on random 
instances are sometimes criticized for being more about the model than about the algo- 
rithms (Johnson, 2002). We believe that it is better to study the interaction of models with 
algorithms. For example, in Section 4.2 we observe a double peak phenomenon where our 
best guess suggests it is related to the (lack of) consistency enforcement in the algorithms 
we use interacting with the degree of consistency enforced in the instances by the various 
models. We propose a very flexible framework for constructing problem instances with in- 
teresting behavior and develop a variety of concrete methods to construct specific random 
CSP models that enforce different levels of constraint consistency. We hope this framework 
will be useful in helping researchers construct pseudo-random instances exhibiting various 
kinds of structure, while avoiding trivialities that have sometimes tainted claimed results 
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in the past, or that may cause algorithm design to become too focused on a particular 
structure to the detriment of a more general application. 

2. Random Models for CSPs 

Throughout this paper, we consider binary CSPs defined on a domain D with \D\ = d. A 
binary CSP C consists of a set of variables x = {xi, • • • , Xn} and a set of binary constraints 
{Ci, • • • , Cm}- Each constraint Cj is specified by its constraint scope, a pair of variables in 
X, and a constraint relation that defines a set of incompatible value-tuples in D x D 
for the scope variables. An incompatible value-tuple is also called a restriction, or a nogood. 
Associated with a binary CSP is a constraint graph whose vertices correspond to the set 
of variables and whose edges correspond to the set of constraint scopes. Throughout the 
discussion, we assume that the domain size is a fixed constant. In the rest of the paper, we 
will be using the following notation: 

1. n, the number of variables; 

2. m, the number of constraints; 

3. d, the domain size; and 

4. t, the constraint tightness, i.e., the size of the restriction set. 

Given two variables, their constraint relation can be specified by a 0-1 matrix, called 
the conflict matrix, where an entry at {i,j) indicates that the tuple {i,j) & D x D is 
incompatible. Another way to describe a constraint relation is to use the compatibility 
graph, a bipartite graph with the domain of each variable as an independent part, where an 
edge signifies the corresponding value-tuple is compatible. 

We now define our generic model for generating random binary CSPs. Recall that a 
relation is a set of ordered pairs. 

Definition 2.1. {BrifmijCl^ Restricted Random Binary CSP) Let d,t,n,m be integers 
as specified above, all variables share a common domain D = {!,..., d}, and let C = 
{Li,L2, ... I Lj C -D X D} he a set of relations. Bnfm[^] is a random CSP model such that 

1. its constraint graph is the standard random graph G{n,m) where the m edges of the 
graph are selected uniformly at random from all possible (2) edges; and 

2. for each edge of G{n,m), a constraint relation on the corresponding scope variables is 
specified as follows: 

(a) Choose at random a relation L ^ C. 

(b) Choose t value-tuples from D x D\L uniformly at random as its nogood set. 
To the above definition, we typically want to add the condition that the set C satisfies 

(a, a) ^ Pi Li,\la G D. 
i>l 
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If this condition is not met, then the instances generated by the CSP model are always 
trivially satisfiablc (and vacuously have exponential resolution complexity). This is a very 
mild, but sufficient, condition to make sure that i3n,m['C] has a linear solubility threshold. 
In particular, all the existing models and all the models to be discussed in the present 
paper satisfy the condition. In Appendix A.3, we provide a result showing that under 
this condition, there is a constant c* such that Hn',m['C] with m/n > c* is asymptotically 
unsatisfiable with probability one. 

By placing other restrictions on L we can subsume many of the previously existing 
models as well as the new models we will define shortly. 

Definition 2.2. (Model B: Random Binary CSP B'kU) IfforBn%[C] we haveC = {(/>}, 
we write Bnfm, which is known in the literature as Model B. 

An instance of a CSP is said to be k-consistent if and only if for any {k — 1) variables, 
each consistent {k — l)-tuple assignment to the {k — 1) variables can be extended to a 

consistent A;-tuplc assignment to any other kth. variable. A CSP instance is called strongly 
k-consistent if and only if it is j-consistent for each j < k. Of special interest are the 
strong-k-consistencies for k = 1,2,3, also known as node- consistency, arc- consistency, and 
path- consistency (Mackworth, 1977). 

Note that in Bnjrn there are no restrictions on the construction of the constraint. It has 
been shown that Bn,m is asymptotically trivial and unsatisfiable for t > d, but has a phase 
transition in satisfiability probability for i < d (Achlioptas et al., 1997; Gent et al., 2001). 
Basically, the trivial cases arise because asymptotically with high probability there will be 
a variable such that all of its values arc forbidden due to constraints containing it that are 
not arc-consistent. This motivates the introduction of the flawless conflict matrix to make 
sure that the random model is arc-consistent (Gent et al., 2001). 

Definition 2.3. (Flawless Random Binary CSP Bifm[M]) If for BnL[C] each L C 
is a bijection, L : D ^ D, then we write Bi^rn[M.]- This is the flawless random binary CSP 
as defined by Gent et al. (2001). 

The M in the notation is a reference to matchings. If we use the construction for the 
generalized flawless random binary CSPs that we present in Section 3.2, but restrict the 
underlying graphs to be matchings, then it reduces to the flawless model. 

By specifying bijections we create a set of tuples such that there is one for each i £ D. 
These will not be considered when choosing incompatible value-tuples, and so the resulting 
model is guaranteed to be arc-consistent and consequently will not have flawed variables. 
However, even though the flawless random binary CSP Bn',m[A^] does not suffer from the 
problem of trivial unsatisfiability, it can be shown that Hn',m[A^] may asymptotically have 
embedded easy subproblems loi t > d — 1. 

Theorem 2.1. Fort > d—1, there is a constant c* > such that for any ™ > c* , with high 
probability BufmlM] is asymptotically unsatisfiable and can be solved in polynomial time. 

Proof. We outline the proof here and provide a detailed proof in Appendix A.l. The idea is 
to show that for sufficiently high ^, Bftjm[M] contains structures (sub-instances) that (1) 
cause the unsatisfiability of the whole instance and (2) can be detected in polynomial time. 
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One such structure is the so-called "flower" consisting of a collection of forbidden constraint 
cycles which wc explain below. 

A binary constraint over two variables xi and X2 is said to contain an (a, /3)-forcer if 
for the assignment xi = a, the only compatible assignment to X2 is X2 = P. Consider a 
CSP instance. Let {xo,a;i, • • • ,Xr-i} be a subset of r variables and a be a domain value. 
Suppose that there is a cycle of constraints 

Ci{xo,Xi),C2{xi,X2), . . . ,Cr-liXr-2,Xr-l), and (x^-l , Xq) 

such that Ci{xo,xi) contains an (a, Q!i)-forcer, Ci{xi-i,Xi) contains an (cij-i, ai)-forcer, 
and Cr{xr^i,XQ) contains an (a^-i) /5)-forcer with a ^ (3. Then, the assignment xq = a 
cannot be used in any satisfying assignment. We call such a cycle of constraints a forbidden 
cycle (for the variable xq and domain value a). Now, if there are d (the domain size) 
forbidden cycles for the variable xq, one for each domain value, then the CSP instance is 
not satisfiable since none of its domain values can be in a satisfying assignment. We call a 
variable together with forbidden cycles for each a a flower. 

Using the second-moment method, it can be shown that there is a constant c* such 
that for any ^ > c*, the probability that H^',m[A^] contains a flower is asymptotically one. 
So, with high probability, Hn',m[A^] is unsatisfiable if ^ > c*. Furthermore, if a binary 
CSP instance contains a flower, then any path-consistency algorithm (e.g., Mackworth, 
1977) will produce a new CSP instance in which the variable of the flower has an empty 
domain. Therefore, a CSP instance with an embedded flower sub-instance can be solved in 
polynomial time. □ 

It should be noted that Hn'^[A^] does have a non-trivial phase transition since it is 
satisfiable with high probability if ^ < \. Theorem 2.1 does not exclude the possibility 

that Bnfmi-M] will be able to generate hard instances when ™ is below the upper bound c*, 
in particular in the case of a large domain size. Further investigation is required to fully 
understand the complexity of ;Sn,m[-M] in this regard. 

3. Consistency, Resolution Complexity, and Better Random CSP Models 

Prepositional resolution complexity deals with the minimum length of resolution proofs for 
an (unsatisfiable) CNF formula. As many backtracking-style complete algorithms can be 
simulated by a resolution proof, the resolution complexity provides an immediate lower 
bound on the running time of these algorithms. Since the work of Chvatal and Szemeredi 
(1988), there have been many studies on the resolution complexity of randomly-generated 
CNF formulas (Beame et al., 1998; Achhoptas et al., 2001). 

Mitchell (2002b) developed a framework in which the notion of resolution complexity 
is generalized to CSPs and the resolution complexity of randomly-generated random CSPs 
can be studied. In this framework, the resolution complexity of a CSP instance is defined 
to be the resolution complexity of a natural CNF encoding which we give below. Given an 
instance of a CSP on a set of variables {xi, • • • , Xn} with a domain D = {1, 2, • • • , d}, its 
CNF encoding is constructed as follows: 

1. For each variable Xi, there are d Boolean variables Xi : IjXi : 2,...,Xi : d each of 
which indicates whether or not Xj takes on the corresponding domain value. There is 
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a clause : 1 V : 2 V . . . V : d on these d Boolean variables making sure that Xj 
takes at least one of the domain values; 

2. For each restriction {ai, ■ ■ ■ ,ak) G D'^ of each constraint C(xi^, • • • ,Xif,), there is a 
clause Xi^ : ai V • • • V : to respect the restriction. 

This CNF encoding is equivalent to the original CSP problem in the sense that a CSP 
instance is satisfiable if and only if the CNF encoding is satisfiable. Any satisfying assign- 
ment to the CSP variables translates immediately to a satisfying assignment of the CNF 
encoding. On the other hand, any satisfying assignment to the CNF encoding can also be 
translated to a satisfying assignment to the original CSP instance. If in the CNF assign- 
ment, more than one of the d boolean variables Xi : l,Xi : 2, . . .Xi : d are assigned TRUE, 
we can just pick any one of them and assign the corresponding domain value to the CSP 
variable Xj. It is possible to add another set of 2-clauses for each constraint variable Xj of 
the form {xj : aj V Xj : a/j | 1 < j < /t < d} to ensure that any satisfying assignment to the 
CNF encoding assigns a unique domain value to each CSP variable. However, these do not 
in general change the complexity results, and make the analysis more complicated. 

Mitchell (2002b) as well as Molloy and Salavatipour (2003) showed that random CSPs 
will have an exponential resolution complexity if their constraint tightness t is less than 
a certain value. For random binary CSP Bnfm, the requirement is (1) t < d — 1; or (2) 
t < d and ^ is sufficiently small. For t > d — 1, recent theoretical results (Gao & Culber- 
son, 2003; Molloy &: Salavatipour, 2003) indicate that it is still possible for these classical 
models to have an asymptotic polynomial complexity due to the existence of embedded easy 
subproblcms. 

In the following, we will show that it is not necessary to put restrictions on the constraint 
tightness in order to have a guaranteed exponential resolution complexity. Based on similar 
arguments as those in the literature (Mitchell, 2002b; Molloy &; Salavatipour, 2003; Beame, 
Culberson, Mitchell, & Moore, 2005), it can be shown that if in Bn'mf-C], the constraint 
relation of each constraint is chosen in such a way that the resulting instances are always 
strongly 3-consistent, then Bnfm[j^ has an exponential resolution complexity no matter how 
large the constraint tightness is. 

Theorem 3.1. Let C he such that Bn^m[C\ is strongly 3-consistent. Then for any constant 
^ = c > 0, the resolution complexity of Bnfm[^] is almost surely exponential. 

Proof. See Appendix A. 2. □ 

Using the tool developed by Molloy and Salavatipour (2003), the requirement that CSP 
instances be strongly 3-consistcnt to have an exponential resolution complexity can be 
further relaxed. Recall that a constraint on xi,X2 contains an (a, P) -forcer if for xi = a 
the only consistent assignment allowed by the constraint is X2 = /3. 

Definition 3.1. We call a CSP instance weakly A-consistent if 

1. it is arc-consistent, 

2. it contains no forcer, and 
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3. for any 4. distinct variables xi, X2, X3, X4 and constraints Ci{xi,X2),C2{x2,xz),C^{x^,Xi) 
and for any ai,a4 £ D there exist Q^s £ D such that the assignment xi = ai,X2 = 
a2,X3 = a3,X4 = 04 is consistent with Ci, C2 and C3. 

Note that weak 4-consistcncy is weaker than strong 3-consistency since it does not 
require an instance to be 3-consistent. 

Theorem 3.2. Let L he such that BnfmiJ--] is weakly A- consistent. Then for any constant 
^ = c > 0, the resolution complexity of Bnfm[j^ is almost surely exponential. 

Proof. See Appendix A.2. □ 

The proofs of the above theorems arc based on the structural properties of the underlying 
constraint random graph of the CSP models and the relation between the size of a resolution 
proof and the maximum size of a clause in the resolution proof. Below we briefly discuss 
the key ideas of the proof of Theorem 3.1 to illustrate the role of constraint consistency. 
First, in order for a CSP instance to have an exponential size proof, the size of a minimum 
unsatisfiable sub-instance (MUS) must not be constant. Otherwise, an enumeration of all 
the partial assignments to all the sub-instances of size less than that of the MUS gives us 
a proof of polynomial size. In the case of Bn%,[jC-], the minimum vertex degree of a MUS 
must be larger than 2 since it is 3-consistent. Due to the local sparseness of a random 
graph forced by the assumption ^ = c, the minimum size of a MUS has to be linear in the 
problem size n. 

Second, Ben-Sasson and Wigderson (2001) have shown that to establish an exponential 
lower bound on the resolution complexity, it is sufficient to show that any resolution proof 
must contain a clause whose size is linear in the problem size. Let s be the minimum size 
of a MUS. It can be shown that any resolution proof must contain a clause C such that the 
minimum size of the sub-instance J that implies C is at least |. Since the instance is 3 
consistent and J' is minimal, the clause C must contain one literal related to each of the 
variables in J' whose vertex degree is less than 3. A random graph argument shows that the 
number of such variables in any sub-instance is linear in n. In summary, due to constraint 
consistency and the local sparseness of a random constraint graph, any resolution proof 
must contain a clause whose derivation involves a linear number of variables. 

The question remaining to be answered is whether or not there are any natural random 
CSP models that arc guaranteed to be strongly 3-consistent or weakly 4-consistent. In fact, 
the CSP-encoding of random graph fc-coloring is strongly ^-consistent. In the rest of this 
section, we discuss how to generate random CSPs with a high tightness that are strongly 
3-consistent or weakly 4-consistent. 

3.1 The Generalized Flawless Random Binary CSP 

We now introduce our generalized flawless model. We call this a generalized flawless model 
since it requires even stronger conditions on jC than the flawless model of Gent et al. (2001). 
We assume a common domain D = {1, . . . ,d},d > 2. 

Definition 3.2. We say that L is SC-inducing if 

1. ML G C and Ma G D there exist (3,^ £ D such that (a, (5) £ L and (7, a) G L and 
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2. VLi,L2 G C and'ia,^ G D there exists (3 E D such that (a,/3) G Li and (/?, 7) G i^2- 

The first condition in Definition 3.2 guarantees arc consistency in any constraint. The 
second condition ensures 3-consistency. Given d > 2, these are the two conditions for strong 
3-consistency. 

Definition 3.3. We say that L is WC-inducing ij 

1. VL G C and Va G D there exist /3i , /32 , 71 , 72 G D such that (a,/3i), {a, 132) G L and 
(71, a), (72, a) G L anc? 

5. VLi, L2, L3 G £ and Va, S e D there exist 7 G D swc/t i/tat (a, /?) G Li, (/?, 7) G L2 
and (7, 5) G -L3. 

The first condition in Definition 3.3 is to prevent forcers as well as to enforce arc con- 
sistency, which are the first two conditions required by the definition of weak 4-consistency. 

Definition 3.4. (Generalized Flawless Random Binary CSPs) 

For the model Bnfm[^] if C, is SC-inducing then we write Bnfm[SC\- If JC is WC-inducing 
then we write Bnfrn[WC\. In either case, we say we generate a generalized flawless random 
binary CSP. 

Lemma 3.3. A CSP constructed by Btfm[SC\ is strongly 3-consistent. A CSP constructed 
by Bnjm[WC\ is weakly A- consistent. 

We should note that just as the flawless method could not generate every arc consistent 
CSP (Gent ct al., 2001) not every SC (WC) CSP will be generated by our system. For 
example, if there is a constraint C on variables xi and X2 then with respect to a third 
variable X3 strong 3-consistency requires only that there be a consistent assignment to 0:3 
for those pairs of values that are compatible in C . Our method ensures consistency on every 
pair of values for xi, X2 independently of whether or not there is a constraint on them. We 
see no reasonable way to produce the relaxed version given that the constraints have to be 
generated independently of G{n,m). 

3.2 Constructing Consistent-Inducing Sets jC 

We now provide methods to generate a variety of sets of relations C which are SC-inducing 
or WC-inducing. Although our CSP models require all variables to have a common domain, 
we will start with a more general construction on varying domain sizes. This will enable 
us to significantly increase the domain tightness on larger domains. As a side benefit, it 
indicates how the models might be adapted to varying domains. However, to build a model 
for varying domain sizes would require at least that we have distinct sets L for every ordered 
pair of distinct domains. It would also complicate the selection step, 2(a) in Definition 2.1 
as well as the analysis. We leave further development of such models for future research. 

We start by constructing a set of bipartite graphs G, which we call the core graphs. 
For each pair of domains and Dr with sizes di = \D£\,dr = \Dr\, there will be one 
or more core graphs G £ Q. Each graph G G ^ is of the form G = (Vg DVr,E) where 
V( = {vi, . . . , Vd,^} and Vr = {wi, . . . , w^^} are sets of vertices and E C V( x Vr. For each 
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graph G we create a set of size kc of triples of the form T{G) = {{G, tth, iTri) | 1 < i < ka}, 
where vr^j : Vi ^ and 7r,.,j : Vr ^ are bijections labcUng the vertices. 

We define T = T{g) = [JGegT{G). For each T G T, where T = (G, vr^, vr,-), we let L{T) 
be the set of pairs {{7:e{v),7rr{w)) \ {v,w) G E{G)}. We define £ = C{T) = {L{T) | T G T}, 
the set of relations induced by the triples. 

Given a bipartite graph G = (VeUVr, E), define 9i to be the minimum degree of a vertex 
in the vertex subset Vi where i G {i, r} . 

We say that Q is degree bound if for each graph Oi > ^ and 9r > y- 

Lemma 3.4. If Q is degree bound then for any Ti,T2 G T sharing the domain D = D^i = 
De2> for any a G -0^1,7 G Dr2, there exists (3 E D such that (a,/3) G Li = L(Ti), 7) G 
L2 = L{T2). 

Proof. Let v = 7r£^^(a) be the vertex in Gi labeled by a. Since deg(t;) > On, the number 
of pairs in Li which have a as their first element must be greater than 5. Similarly, the 
pairs in L2 with 7 as the second element must also cover more than 1/2 of D. Thus, there 
must exist P such that {a, (3) G Li, (/3,7) G L2. □ 

Corollary 3.5. If Q is degree bound on a common domain D then C{T) is SC-inducing. 

It is not hard to see that without restricting the bijections Lemma 3.4 provides the best 
possible result. 

Lemma 3.6. IfT is not degree bound and arbitrary bijections tt^ and iVr are allowed, C{T) 
may not be SC-inducing. 

Proof. Consider two relations Li and L2 on a common domain D with graphs Gi , G2 such 

that Oil < 2 and 0^2 ^ 2- Let a G and P G D4 and note that the neighbors of the vertices 
labeled by these values will each cover at most 1/2 of D. Since we can choose VTri and 77^2 
arbitrarily, we can ensure that there is no value for X2 consistent with xi = a, X3 = /3. □ 

Consider the graph on a domain with d = 9 shown in Figure 1. If we allow arbitrary 
bijections 7re,nr then C is not SC-inducing, but it is WC-inducing, which the reader may 
readily verify. The degree of the vertices in this graph is ^ = 4, which is less than that allowed 
by Lemma 3.4. It remains WC-inducing because the structure of this graph distributes the 
edges in a uniform way. 

To generalize this observation, assume a common domain D and consider three con- 
straints induced by Li,L2,Ls built from some core graph set Q. If we have minimum 
degree 6 then for any value on the left of Li there will be 9 values on the right of Li, and 
if we allow arbitrary bijections, these may match any subset of size 9 on the left of L2. 
Similarly, any value on the right of L3 will match an arbitrary set of 9 values on the right 
of L2. To ensure that Q generates an £. that is WC-inducing, all we need is that any such 
arbitrary pair of 9-scts induces at least one edge in any G £ Q. If any pair of subsets fails 
to induce an edge on some core graph, then the corresponding constraint set is not weakly 
4-consistent. 

Letting Ng{v) be the neighbors of vertex v in graph G, this argument generalizes to the 
formal statement 
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Lemma 3.7. On a common domain D, if^G G Q the minimum left and right degree is 
= 0J. = 6*, then Q generates a WC-inducing C using arbitrary bijections Tr^jTr^ iff^G G Q, 
forie {e,r} and for all S C Vi,\S\ = 0,\[j^^s Ng{v)\ >d-e. 

A graph is 6-regular if every vertex is of degree 9. A graph is connected if there is a 
path between every pair of vertices. Notice that in a degree bound Q each G is necessarily 
connected. The following is a special case where G will generate WC-inducing but not 
SC-inducing £.. 

Corollary 3.8. Suppose we have a common domain D with d = 26 for some 9 and Q is 
a set of 9 -regular connected bipartite graphs. Let T be a set of triples generated by random 
pairs of bijections on graphs in Q . Then C{T) is WC-inducing. 

Proof. By Lemma 3.7, we must show that for an arbitrary subset S d where IS"! = 9, 
the neighbors of S contain more than 9 vertices. Suppose not. Note that we must have 9'^ 
edges on the vertices in 5*, so if there arc 9 other distinct endpoints (there cannot be fewer) 
each of these has 9 edges. It follows that these are the only edges on the neighbors, but 
then G is not connected. □ 

Lemma 3.7 can be used to show that 9'^ > d. It also follows easily that G must be 
connected. These bounds are met at d = 4, ^ = 2 using the eight cycle as illustrated in 
Figure 1. For larger d we mostly leave the problem of optimizing as a research project 
in combinatorial design, although we will provide some insight in the next section. A few 
other WC-inducing graphs are shown in Figure 1. 

3.3 Recursive Construction 

By Lemmas 3.4 and 3.6 if we do not introduce additional restrictions on the bijections, the 
best tightness we can achieve while maintaining strong 3-consistency \s t < If we only 

require weak 4-consistency, then Lemma 3.7 may allow significant improvement, although 
it is unclear exactly how much in general. We will now show how to increase tightness by 
reducing the number of edges in the graphs using a recursive construction for larger domains 
that simultaneously restricts the set of bijections. 

Given a domain D we partition it into a set of blocks 11 = {Hi, . . . , 11^} subject to A; > 3 
and |IIj| > 3,Vj. These latter restrictions arc in place because the only SC-inducing graphs 
on pairs of size two are the complete bipartite graphs. The blocks Hj may vary in size. 

Each variable xi must have exactly one partition Ilj of its domain which will be used in 
the construction of all constraints on Xi. For problems with a common domain it is simplest 
to have just one partition of the common domain for all variables, which is the construction 
we will now assume. 

Let V = {vi, . . . , Vd}. A bijection tt : F D is said to respect the partition 11 if there is 
a bijection tt* : 11 ^ 11 on the partition such that for each j,l < j < k and for each i G Ilj 
we have 7r{vi) G 7r*(nj). Note that to meet this condition, vr* must satisfy the condition 
llljl = |7r*(nj)| for each j. Thus, partitions with varying sizes of blocks will further restrict 
the set of respectful bijections. 

We construct a H-bound graph G as follows. Let Vm = {v[, . . . ,v'i^} and Vur = 
{w'l, . . . ,w'i^} be sets of vertices corresponding to the blocks of the partition. Let Gjj = 
U Vur, -E-n) be an SC-inducing graph. 
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Figure 1: Graphs that are WC-inducing but not SC-inducing for domains of size 4,5,6 and 
9. Notice that for d = 5 the minimal graph is not regular. The boxes in the graph 
for d = 9 indicate the recursive construction as described in Section 3.3. See also 
Figure 2. 



We assume as in the general construction two sets of vertices = {vi , . . . , v^} and 

Vr = {wi, . . . , Wci}- For each edge {v, w) G Ejj we construct a degree bound graph G^w = 
{Ve[U„] U Vr[U^aj],E„.^^), where Vf\U.,] = {v, ■ i G nj and V;.[n,,,,] = {uh : i G H^}. Recall 
that degree bound graphs work even when the domains arc of differing sizes. We construct 
the Il-bound graph G = {V(UVr,E) by letting E = U(y 

Lemma 3.9. IfG consists of Il-bound graphs and all bijections nijiTr respect Tl, then C is 
SC-inducing. 

Proof. Consider two relations Li,L2. Since block graphs Gn are SC-inducing, it follows 
that there is a path between any block on the left graph and any block on the right under 
the block permutations tt* implied by respectful bijections. But now focusing on the edges 
and bijections restricted to the blocks on such a path, since these induce degree bound 
graphs, the proof is completed by an argument similar to the proof of Lemma 3.4. □ 

Figure 1 shows a graph generated by this method for d = 9. (For d = 3, there is only 
one minimal degree bound graph up to isomorphism.) Figure 2 shows the construction of 
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Figure 2: An illustration of a strongly 3-consistent pair constructed using a 3 x 3 recursive 
construction on a domain of size 9. For simplicity, we use identity bijections on 
the far left and right. In the center, the dashed lines represent the bijection on 
the shared domain induced by iTri and 7r£2- T^ri and tt£2 respect the blocks, but 
rearrange the values at both the block and vertex levels. We highlight value 1 on 
the left and the vertices connected to it up to the left side of L2, where there are 
2 vertices in each of two blocks. These are sufficient to connect to any value on 
the right. 



two constraints using two copies of this graph. The bijections respect the block structure, 
and so the constraint is SC-inducing. 

For very large domains, we notice that wc may have many blocks in 11. Since Gji is 
SC-inducing, we can recursively use the construction to build Gu- We call this construction 
the recursive Il-bound construction. For example, similar to the graph for d = 9 in Figure 1, 
we see that for domains of size d = 3* this recursive construction lets us use graphs with 
degree 2*. Thus, tightness can be as high as — d^+^°^3^ on SC-inducing graphs. 
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We note that it is also possible to recursively partition very large blocks, resulting in 
possibly non-uniform depth recursive constructions, provided all permutations are block 
respecting in the appropriate sense. We leave this as an exercise as the notation gets messy, 
and applications appear limited. 

We also claim without proof that if all blocks in II are of the same size, and we make 
Gji WC-inducing and replace the degree bound subgraphs with WC-inducing graphs, then 
provided we use Il-respecting bijections, Q is WC-inducing. We call such graphs weakly 
Il-bound. As before, we can recursively construct Gn to get recursive weakly Il-bound 
graphs. For d = 4* using a recursion based on the d = 4 case in Figure 1, we can have weak 
4-consistency where the core graphs have degree 2*; that is, tightness in this case can be as 
high as - d^/^. 

Although there is increased tightness using these recursive constructions, there is a down 
side in that we greatly reduce the set of allowed bijections. For example, to respect the 
block structure of the graph on d = 9 shown in Figure 2, we have at most (3!)^ = 1296 
distinct bijections on either side, compared to 9! = 362880 unrestricted bijections. We also 
note that the recursive constructions have limited practical application since it is expensive 
to express tight constraints in an explicit way for such large domains. 

3.4 Limits to Tightness and Complexity Tradeoffs 

Can we get even tighter constraints with exponential resolution complexity? The discussion 
so far indicates we are near the limits using our consistency inducing constructions, so we will 
need something else. The following construction illustrates that it is possible to maintain 
exponential complexity while having even higher tightness, but it is trivial in the sense that 
it is really only exponential in a linear sized embedded subproblem. 

Let us embed 3-coloring in a domain of size d > 3. If we wish to construct a CSP version 
of fc-coloring for graphs, we start with a coloring domain K = {1, . . . , fc}. We create one 
graph G = {ViLlVr,E) e g, with edge set E = VexVr\ {{v,, w,) \ l <i < k}. We restrict 
the bijections to TT£{vi) = i, iTriwi) = i,l < i < k, which we refer to as the identity bijections 
or identities, ti, Lr for short. So there is only one relation in C = {L{G, Li, i^)}- To complete 
the fe-coloring implementation, we set t = k, the maximum possible value for t. To embed 
3-coloring in a domain of size d, let G be constructed as in the 3-coloring construction on 
the sub-domain {1,2,3} and then pad G with independent vertices {vi,Wi \ 4 < i < d}. We 
construct one triple (G, i£, tr) and again we have one relation in C. We set tightness to its 
maximum possible value, t = d^ — 6. 

We conjecture this to be the maximal possible tightness with exponential complexity, 
but it is of little relevance for larger d. For this £, on any instance from BufmijC] any 
reasonable algorithm will simply reduce it to 3-coloring, and will thus exhibit the exponential 
complexity of the 3-coloring threshold. 

We see that maximizing tightness is certainly not the only consideration, and probably 
not the most important. We feel it is more important to consider the kinds of structure that 
one can embed in a model, while avoiding generating trivial instances. Our model expands 
the framework wherein this can be done over previous models. 
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3.5 Implementation Issues 

The experiments conducted by Gao and Culberson (2004) and many of those in this paper 
use a domain size of d = 4 and the 8 cycle WC-inducing graph shown in Figure 1. We 
should point out that in the experiments conducted by Gao and Culberson (2004) we fixed 
the bijection tt^ = n and randomly assigned only the right hand set tt^. Although this is 
adequate to satisfy the WC-inducing conditions, it does not include all possible L, since for 
example for any L generated that way there will be an a such that both (l,a) and (2, a) 
are in L. If during the construction of constraints the variables are ordered in a consistent 
way, for example with the smaller index always on the left side of the constraint, then this 
may introduce an overall asymmetry in the CSP. 

In the following, we report a new series of experiments in which we used the randomly 
paired bijection approach. 

The best degree bound graphs, in the sense of maximizing t, arc those in which all 
vertices are of the minimum possible degree, that is ^-regular graphs. For our purposes, 
there is no requirement that these graphs be uniformly selected from all possible such 
graphs, and it is not too hard to find various techniqiies to get a variety of such graphs. For 
graphs conforming to Corollary 3.8 for example, one can first generate a cycle alternating 
between vertices oiV^,Vr and then choose a set of maximum matchings from the remaining 
unused pairs to reach the required degree. These can be found using any of a variety of 
techniques, for example by finding a 1-factorization of a regular bipartite graph (Bondy & 
Murty, 1976, Chapter 5). Should one desire to have a uniform sampling for some reason, 
there is a significant literature on generating random regular graphs of various types, which 
might serve as a starting point (Wormald, 1999). 

4. Experiments 

In this section, we discuss a series of experiments we have conducted to study the impact 
of the structural elements introduced in the generalized fiawless model on the typical case 
hardness of problem instances of practical size. In Section 4.1, we discuss an experiment 
on a Boolean 3-ary CSP model obtained from the widely-used random distribution of CNF 
formula. In Section 4.2, we report detailed experimental results on the comparison of three 
different random CSP models. 

4.1 Randomly-generated 3-ary Boolean CSPs 

The observation from this set of experiments on Boolean CSPs with different constraint 
tightness partly motivated this study on the interplay between the performance of backtrack- 
ing algorithms and the structural information in the seemingly "structureless" randomly- 
generated instances. We include the results here since they provide a nice illustration of the 
effect of an increase in the constraint tightness (hence an increase in the likelihood of the 
existence of a forcer in a constraint) on the typical case hardness of random CSP instances. 

To obtain an instance distribution for the ternary boolean CSP, we start with a random 
3-CNF formula in which each clause is generated by selecting three variables uniformly 
at random without replacement. A clause in a CNF formula is equivalent to a constraint 
with constraint tightness t = 1 over the same set of variables. E.g., xy y\J z corresponds 
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to the restriction {(0,0, 1)}. We can therefore obtain random instances of Boolean-valued 
CSPs with a larger constraint tightness by adding more clauses defined over the same set 
of variables. 

Let J^{n,m) be a random 3-CNF formula with n variables and m clauses selected 
uniformly at random without replacement. We construct a new random 3-CNF formula 
jr(n, m, a) as follows: 

1. J^{n, m, a) contains all the clauses in J^{n, m); 

2. For each clause C in T{n, m), we generate a random clause on the same set of variables 
of C, and add this new clause to J^{n, m, a) with probability a. 

In fact, J-{n, m, a) is the random Boolean CSP model with an average constraint tightness 
1 + a and has been discussed by Gao and Culberson (2003). For a > 0, it is easy to see 
that J^{n, m, a) is always strongly 2-consistent, but is not 3-consistent asymptotically with 
probability 1. 




Figure 3: Thresholds for the solution probability in the model J^{n, m, o) with n = 250. 

The z-axis is the solution probability. The axis with the range 1.0 ... 2.0 is for 
the parameter 1 + a and the axis with the range 1.0 ... 6.0 is for the clause density 
m/n. 

Figure 4 shows the median of the number of branches used by the SAT solver ZChaff 
(Zhang et al., 2001) on 100 instances of jr(n, m, a) with n = 250. Figure 3 shows the 
solution probability of the same model. As expected, an increase in the tightness results 
in a shift of the location of the hardness peak toward smaller m/n. More significant is the 
magnitude of the decrease of the hardness as a result of a small increase in the constraint 
tightness. The instances hardness varies so dramatically that it is hard to illustrate the 
difference for all the constraint tightness values from t = 1.0 to t = 2.0 using either the 
original scale or the log scale. This explains the scale scheme used in Figure 4. 

The reason for such dramatic change can be explained as follows. If we randomly gener- 
ate constraints with tightness t > 1 then for each constraint, there is a positive (fixed) 
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Figure 4: Effects of an increase in the constraint tightness on the instance hardness for 
J-{n, m, a) with n = 250. The z-axis is the median number of branches with the 
first three highest peaks scaled to 1/40, 1/10, and 1/4 respectively. The axis with 
the range 1.0 ... 1.8 is for the parameter 1 + a, < o < 0.8 and the axis with the 
range 2.5 ... 6 is for the clause density m/n. 



probability that the restriction set will be something such as {(1, 0, 0), (0, 0, 0)}. This 
constraint is not 3-consistent, as the partial assignment y = 0,z = has no consistent 
extension to x. In effect, this constraint induces the binary constraint on y, z with the re- 
striction {(0, 0)}. If there are enough such constraints, then the induced binary constraints 
create an unsatisfiable subproblem, and since the domain is also Boolean, this subprob- 
lem can be detected in polynomial time. Upper bounds on m/n have been established for 
the existence of the easy unsatisfiable subproblems in T{n,m,a). For example, we know 
that the upper bounds on m/n for T{n,m,a) to have an exponential resolution complex- 
ity are respectively 23.3 if a = 0.1 and 11.7 if a = 0.2 (Gao & Culberson, 2003). Since 
the ratio of constraints to variables m/n considered in the experiment are well below these 
bounds above which embedded 2SAT subproblems appear with high probability, it seems 
that the impact of forcers on the instance hardness goes beyond simply producing higly- 
structured 2-SAT-like embedded easy subproblems. We will see a similar effect in the next 
subsection on non-Boolean valued CSP instances. 

4.2 Random Binary CSP Models: the Significance of Structures 

This set of experiments is designed to investigate whether introducing structural elements 
that enforce constraint consistency in random CSPs leads to a significant increase in the 
typical case hardness of instances of practical size. It should be mentioned that the purpose 
of the experiments is not to compare and rank the relative merits of the solvers we have 
used in the experiments. Neither is it our intention to exhaust all the available solvers and 
implementation techniques to solve this set of problem instances. 
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The three random CSP models we consider are Btfm (the Model B), B^^miM] (the 
flawless model), and the generalized flawless model Sn,m[-C] with different domain size and 
different consistency-inducing core graphs. Randomly-generated instances from these mod- 
els are encoded as CNF formulas and solved by the SAT solver ZChaff ^. Also included 
are some experiments on the comparison of ZChaff, SatZ (Li &; Anbulagan, 1997) ^, and a 
CSP solver based on forward checking (FC) and maintaining arc consistency (MAC) with 
a static variable order^. 

It looks unnatural that we have primarily tested random CSP instances by converting 
them to SAT instances and using a SAT solver to solve them. This is justified by the 
following considerations. First, all of the existing research on the resolution complexity 
of random CSPs has been carried out by studying the resolution complexity of a SAT 
encoding of CSPs as described in Section 3. We use the same encoding in the experiments. 
Secondly, it has been shown that as far as the complexity of solving unsatisfiable CSP 
instances is concerned, many of the existing CSP algorithms can be efficiently simulated by 
the resolution system of the corresponding SAT encodings of the CSPs (Mitchell, 2002a). 

Experimental comparisons were conducted for the three CSP models with domain size 
(d = 4, 5, 6, and 9) and different values of constraint tightness. The generalized flawless 
CSP models are constructed using the WC-inducing core graphs as described in Figure 1. 
In the next two subsections, we focus on the results for the domain size d = 4 and 9. The 
experiments were carried out on machines with AMD Athlon (tm) Processor (Model 3700, 
Frequency 2.4GHz, Main Memory 1GB with 1MB Cache) running Linux. The following 
setup was used in all of our experiments: the sample size for each parameter point is 100; 
the cutoff time for all the solvers is 1800 seconds of CPU time. 

4.2.1 The Case of d = 4 

For the case ofd = 4, the<*„[WC] we use in the experiment is based on the WC-inducing 
core graph shown in Figure 1, a connected 2-regular bipartite graph (or an 8-cycle). Note 
that for this core graph, the maximum possible constraint tightness is 8. 

To observe the detailed behavior of the models, we first fixed the constraint tightness 
to t = 6 and the number of variables to n = 500. Figure 5 plots the solution probability 
as a function of the ratio of constraints to variables for the three CSP models. From the 
experimental data, we observed that the phase transition of ;Bn,m [WC] is much sharper than 
those of Bifm[M] and Bifm^. 

More importantly, instances of Bnfm [WC] at the phase transition are clearly much harder 
than those of B^fm and Hn'^iA^]- Figures 6, 7, and 8 show the 50th percentile, 20th 
percentile, 10th percentile, and 5th percentile of the number of branches and running time 
in seconds for ZChaff to solve randomly-generated CSP instances from the three models. 
As can be seen, instances drawn from Btfm[^C] are consistently much harder than those 

1. Available at http://www.princeton.edu/"'chaff/zcliafr.html 

2. Available at http://www.laria.u-picardie.fr/~cli/EnglishPage.litml 

3. Available at http://ai.uwatorloo.ca/~vanbcck/softwarc/softwaro.html 

4. The reader should be aware that this model doesn't show a phase transition in the limit (as problem size 
approaches infinity). 
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Constraints-Variables Ratio 

Figure 5: Solution probability as a function of the ratio of constraints to variables for the 
three random CSP models with n = 500, t = 6. For the generalized flawless 
model, C is the set of connected 2-regular bipartite graphs. The y-axis is for 
the solution probability and x-axis for the ratio of constraints to variables m/n. 
Sample size for each data point is 100. 

from Bnjm and ;Sn,m[-M] in terms of both the size of the search tree and the running time 
and for satisfiable instances (Figure 8) as well as unsatisfiable instances. 

Careful readers may have noticed that for the model H^'^[WC], the number of branches 
has an obvious secondary peak before the phase transition (Figure 6 and Figure 8), an 
"anomaly" that we have reported in our previous paper (Gao & Culberson, 2004). A de- 
tailed look at the experimental data reveals that the secondary peak, though persistent in a 
statistical sense, appears only for the number of branches; We noticed that the measure of 
running time docs not have such a secondary peak. We have tried several approaches to un- 
derstand such a secondary peak, and finally conclude that it is a solver-dependent behavior 
caused the branching heuristics, the CNF encoding, and the level of the local consistency 
enforced by the CSP models. We provide some further discussion of this phenomenon in 
the next subsection. 

For the case of d = 4, we also studied the behavior of the CSP models with different 
constraint tightness values. The results are reported in Tables 1 and 2 where we show 
the maximum over all the ratio of constraints to variables of the median of the number of 
branches for the three CSP models with the constraint tightness ranging from t = 5 to 
i = 8. The results are consistent with those observed in the experiments on Boolean CSPs, 
showing that an increase in constraint tightness has a significant impact on the typical case 
instance hardness. It is also clear that for all the constraint tightness values, instances from 
the generalized fiawless model ;Bn,m [WC] are significantly harder than those from the two 
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Figure 6: Number of branches used by ZChafF to solve instances from the three random CSP 
models with n = 500, t = 6,d = 4. For the generalized flawless model ;Sn'm [WC], 
the core graph is the connected 2-regular bipartite graph described in Figure 1. 



classical models. In the case of i = 5, instances drawn from ;Sn'm[WC] cannot be solved 
within the time limit of 1800 seconds even before the solubility phase transition. 

To further confirm that enforcing constraint consistency increases the typical case hard- 
ness, we tested two other backtracking solvers in our experiments: SatZ and the CSP solver 
with FC + MAC and a static variable ordering. 

In this set of experiments, the problem size is n = 300 and we run all the solvers on the 
same instances to reduce variance. We summarize the results in Tables 3 and 4. 

For problem size n = 300, ZChaff and SatZ have little difficulty in solving randomly- 
generated instances from all the three models except that in some rare cases, SatZ can 
get stuck on instances generated with the ratio of constraints to variables well below the 
threshold. Although the CSP solver works directly on the CSP instances, it does not perform 
as well as the SAT solvers working on the encoded instances. Given the long history of 
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Figure 7: User CPU time in seconds to solve instances from the three random CSP models 
with n = 500, t = 6, d = 4. For the generalized flawless model ;Bn'm[WC], the core 
graph is the connected 2-regular bipartite graph described in Figure 1. 
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Figure 8: Number of branches used by ZChaff to solve satisfiable instances from the three 
random CSP models with n = 500, t = 6, d = 4. For the generalized flawless 
model Bn',m[WC], the core graph is the connected 2-regular bipartite graph de- 
scribed in Figure 1. 
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(n,t) 


branches 


m/n 


branches 


m/n 


branches 


m/n 


(500, 5) 


10673 


2.30 


20271 


2.90 


? 


> 3.10 


(500,6) 


7243 


1.80 


7446 


1.90 


94281 


2.44 


(500, 7) 


4440 


1.40 


5123 


1.50 


13813 


2.00 


(500,8) 


1853 


0.90 


4113 


1.20 


9163 


1.60 



Table 1: The maximum, over all — , of the median of the number of branches of ZChaff on 
100 random instances of the three random CSP models , where the domain size 
d = 4, tightness t = 5,6,7,8. For the generalized flawless model i3n',m[WC], the 
core graph is the connected 2-regular bipartite graph described in Figure 1. 





tod,t 
>-'n,m 




finfm [WC] 


{n,t) 


seconds 


m/n 


seconds 


m/n 


seconds 


m/n 


(500, 5) 


0.08 


2.80 


1.79 


2.90 


> 1800 


> 3.10 


(500,6) 


0.02 


2.00 


0.02 


1.90 


40.3 


2.44 


(500, 7) 


0.01 


1.40 


0.02 


1.70 


1.08 


2.00 


(500, 8) 


0.00 


0.90 


0.00 


1.00 


0.26 


1.60 



Table 2: The maximum, over all ^, of the median running time in seconds of ZChaff on 
random instances of the three random CSP models , where the domain size d = 4, 
tightness t = 5,6,7,8. For the generalized flawless model ;Sn,m [WC], the core 
graph is the connected 2-regular bipartite graph described in Figure 1. 



development of SAT solvers, with significant branch selection heuristics of SatZ and clause- 
learning (nogood recording) in ZChaff, this should not be surprising. In particular, the 
design oiBifmiWC] can be seen as rendering the consistency checking of the CSP solver 
less effective. Still, the better performance of the SAT solvers indicates there is still some 
structure to exploit. 

4.2.2 The Case of d = 9 

To further study the robustness of our method and to seek explanations on the double 
peaks, we consider the case of domain size d = 9. It turns out that for d = 9 we are able to 
construct a variety of random CSP models that have significantly different behavior. We 
consider a collection of 5 random CSP models: 

1. Biin (model B), 

2. Bifrn[M] (the flawless model), 
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ZChaff 


SatZ 


CSP (FC + MAC) 


(n,i) 


seconds 


m/n 


seconds 


m/n 


seconds 


m/n 


(300, 5) 


4.16 


3.10 


2.50 


3.10 


90.21 


3.10 


(300,6) 


0.33 


2.50 


0.31 


2.50 


12.11 


2.50 


(300, 7) 


0.11 


2.00 


0.11 


2.00 


3.10 


2.00 


(300, 8) 


0.03 


1.70 


0.03 


1.70 


0.60 


1.70 



Table 3: Maximum, over all — , of the median mimber of branches of ZChaff, SatZ, and a 
CSP solver with FC + MAC on random instances of ;B^'m[WC] , where n = 300, 
d = 4, t = 5,6,7,8, and the core graph of ;Sn,m [WC] is the connected 2-regular 
bipartite graph described in Figure 1. 





ZChaff 


SatZ 


CSP (FC + MAC) 


{n,t) 


seconds 


m/n 


seconds 


m/n 


seconds 


m/n 


(300,5) 


0.08 


2.90 


0.15 


2.90 


2.68 


3.00 


(300,6) 


0.01 


2.20 


0.02 


2.20 


0.29 


2.20 


(300, 7) 


0.01 


1.70 


0.01 


1.70 


0.09 


1.80 


(300, 8) 


0.00 


1.30 


0.01 


1.30 


0.06 


1.30 



Table 4: Maximum, overall ™, of the median running time in seconds of ZChaff, SatZ, and 
a CSP solver with FC + MAC on random instances of H^',m[A1], where n = 300, 
d = 4, and t = 5, 6, 7, 8. 



3. Hn'mi-Ci] where C\ is constructed using an 18 cycle (i.e., a connected 2-regular bipartite 
graph) with arbitrary bijections, 

4. Hn,m['C2] where C2 is constructed using the core graph shown in Figure 1 with arbitrary 
bijections, and 

5. Hn,m['C3] where £3 is constructed using the same core graph in Figure 1 with block- 
respecting bijections. 

Recall that the model Bnfm[^2] belongs to the class H^'m[WC], while Bn^^i-Ca] belongs to 
the class H^'^[SC]. For all of the five models, we fix the constraint tightness to t = 45, 
the maximum possible constraint tightness that can be achieved by a generalized flawless 
model with the WC-inducing core graph. 

These experiments show that the typical-case hardness of randomly generated instances 
increases with the level of consistency enforced. In Figure 9, we plot the median of the 
number of branches and the median running time in seconds for ZChaff to solve instances 
from Bnjm, Bnjmi-M]-, and Bnfm[j^i]- In Figure 10, we show the median of the number of 
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Constraints— Variables Ratio Constraints— Variables Ratio 

Figure 9: Results on using ZChaff to solve instances from i3n'm, Bi,fm[M], and Bn%.[Ci\ 
with Ci constructed by an 18 cycle (i.e., a connected 2-regular bipartite graph) 
and arbitrary bijections. The other parameters used are n = 300, d = 9, and t = 
45. For iSn'mi-Ci] with the parameter m/n = 2.3, ZChaff failed to solve 45 of the 
100 instances. The data (pointed to by solid triangles) for m/n = 2.3 are based 
on the solved instances only. 



branches and median of the running time in seconds for ZChaff to solve instances from 
^ln[C,2] and H^fml-Cs]. 

For Sn'm and Bn%.[.M], ZChaff is able to solve all the instances within the time limit of 
1800 seconds for all m/n. ZChaff starts to have frequent timeouts when m/n > 2.3. For 
Bnfmi^i] with m/n = 2.3, ZChaff can solve 55 of the 100 instances within 1800 seconds, 
and 10 of the 55 solved instances are unsatisfiable. At m/n = 2.4, 2.5, and 2.6, ZChaff can 
solve 42 instances (respectively, 81 instances, 90 instances) all of which are unsatisfiable. 

For the models 6n'.m [-C2] and;B„',m [-^^3] , wc observed that out of 100 instances generated 
with parameter m/n = 2.3, ZChaff can solve 90 instances from Bnjm[^2] and 33 instances 
from Bnfmi^s]- For Bnfm[^2] with m/n = 2.4, ZChaff only solved 5 instances. All the 
solved instances from ^^'^[£2] and Hn',m['C3] are satisfiable, which is to be expected since 
this region is well below their threshold. We didn't conduct experiments on ^^',^[£2] with 
m/n = 2.5 and 2.6 and Bnjmi'C-s] with m/n = 2.4,2.5 and 2.6, but expect that instances 
from these models will be even harder. 

Having assumed that the secondary peak observed in the CSP models with d = 4 is 
algorithm-independent and unique to our models, we had expected that using a larger 
domain size with a variety of CSP models would be able to help provide a satisfactory 
explanation of the phenomenon. We designed some experiments to empirically investigate 
the relations between the appearance of the secondary peak and three characteristics of our 
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Constraints— Variables Ratio Constraints— Variables Ratio 



Figure 10: Results on using ZChafF to solve instances from Bn%[^i], weakly 4-consistent 
model Bn^m['C-2] and the strongly 3-consistent model Bnjm[jCs]. The other param- 
eters used arc d = 9, n = 300, t = 45. ZChaff starts to have frequent timeouts 
when m/n > 2.3. Therefore for m/n = 2.3, the data (pointed to by solid tri- 
angles) are based on the solved instances only. For the model Hn'mf-Ci], 45 of 
the 55 solved instances are satisfiable and the remaining 10 solved instances are 
unsatisfiablc. For the weakly 4-consistent model Bnfrn[^2] and the strongly 3- 
consistent model Bnfmi^a], Zchaff solved 90 instances (respectively 33 instances) 
all of which are satisfiable. For the weakly 4-consistentmodel Bnfm[^2] with 
m/n = 2.4, ZChaff can only solve 5 of the 100 instances and these 5 instances 
are all satisfiable. 
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CSP models including (1) the number of derangements in the bijections in our constraint 
construction, (2) the level of the enforced local consistency of the models, and (3) the CNF 
encoding scheme. Based on a series of results, we conclude that the secondary peak is an 
algorithm-dependent artifact that is influenced by the selection of the branch heuristics, the 
level of local consistency enforced by the models, and the CNF encoding scheme. In the 
following, we briefly summarize these experimental investigations which, we hope, also helps 
illustrate the flexibility of our framework in constructing an ensemble of problem instances 
to study the behavior of different algorithms. 

1. The Number of Derangements. The number of derangements of a bijection n : V ^ D 
is the number of vertices Vi e V such that TT{vi) i. If there is no derangement in 
the bijections in our constraint construction, then the generated instances are always 
satisfied by the assignments that assign a common domain value to all the variables. 
Contrary to our initial assumption, we did not observe any significant impact of the 
number of derangements on the behavior of the solvers except for the extreme cases 
where we use bijections with no derangement with probability very close to one. 

2. The Local Consistency Level. The main difference among the five models with d = 9 
is the level of the enforced local constraint consistency. Btfm docs not enforce any 
consistency; Bn^m[-M.] enforces arc-consistency; i3n'rn[-^^i] guarantees that there is no 
forcer; Bnjn[C2\ generates instances that are weakly 4-consistent; and instances from 
Bn^m[i^z] are strongly 3-consistent. In addition to the obvious impact on instances 
hardness, our experiments indicate that the local consistency level in the models 
contributes to the appearance of the secondary peak of ZChaff. As depicted in Figures 
9 and 10, for the problem size n = 300, the secondary peak exists in all of the first four 
models which do not enforce strong 3-consistency, while in the fifth model Bnjml^s] 
there is no such peak. However, we observed, on some additional trials not reported 
here, that the secondary peak also exists in ^^',^[£3] as we increase the problem size. 

3. The CNF Encoding Scheme. In our experiments, the CNF encoding of the CSP 
instances does not include the clauses that enforce a unique assignment to a CSP 
variable. While the absence of these clauses does not affect theoretical exponential 
lower bounds on the resolution complexity, we observed in our experiments mixed 
impacts of these clauses on the behavior of the solvers including the appearance of 
the secondary peak of ZChaff. When these clauses are included in the CNF encoding, 
the number of branches and the running time of SatZ both increase, while for ZChaff 
the running time has an obvious decrease. It is also interesting that adding these 
clauses to the CNF encoding also makes the secondary peak of ZChaff even sharper. 
We believe that this is largely due to the greedy branching heuristics in SatZ and the 
tradeoff between the search and the inference achieved in ZChaff. 

While most of the empirical studies on the phase transitions in the AI literature have 
focused on the behavior of the solvers around the solubility threshold of the random 
model, we would like to point out that all of the above observations are made at the 
ratio of constraints to variables below the respective models' threshold, a parameter region 
where the selection of the branch heuristics and the no-good recording technique make a 
big difference. 
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5. Conclusions and Future Directions 

Random or semi-random binary constraint instances have been used to test various aspects 
of CSP solvers by many researchers. To illustrate robustness of algorithms, it is desirable 
that the instances generated under a model not be trivially unsolvable and a minimal 
guarantee of interest is that the generators should produce instances that asymptotically 
have exponential resolution proofs. In this paper we have shown that if we ensure that 
the constraints are such that the instances are strongly 3-consistent or weakly 4-consistent 
then this will be the case. In addition we have shown how to create such instances, while 
allowing for a high constraint tightness, and allowing considerable flexibility in constraint 
design. In the experimental sections we showed a significant increase in difficulty of instances 
generated by our model over other models at similar densities and tightness. We also noticed 
a double peak phenomenon and after further experiments identified it as an artifact of the 
specific solver, but influenced by the choice of the generation model and the characteristics 
of the instances. This exemplifies exactly the kind of algorithmic issue that we hope such 
generators would help researchers identify and explore. 

The generation of hard instances has two foci, one in the restriction of the constraints 
and the other restricting the constraint graph (or hyper-graph). For specific problems, for 
example independent set (Brockington Sz Culberson, 1996) or SAT (Bayardo Jr. Sz Schrag, 
1996), it has been observed that techniques such as balancing the degree of the constraint 
graph to reduce variance increases the difficulty of instances, whether in camouflaging a 
hidden solution or at the phase transition of semi-random instances. We expect that modi- 
fying our generation model by also controlling the graph structure might lead to other harder 
instances, or other interesting properties that would test the mettle of various algorithms. 

For example, suppose we consider the domain D = {0,1,2,3} and let a constraint 
have the set of allowed value pairs {(0, 1), (0, 3), (1, 0), (3, 0), (1, 2), (2, 1), (2, 3), (3, 2)}. This 
constraint could not be generated by our system using the eight cycle that enforces weak 
4-consistency because this value set induces two four cycles on a pair of domains. Thus, 
this constraint will not generate weakly 4-consistent CSPs. On the other hand, it is arc 
consistent and does not contain a forcer, since each vertex in the induced constraint graph 
has degree two. If we consider a constraint graph with a triangle, then applying this 
constraint to all three edges will mean there is no satisfying assignment. Since in a random 
graph with m/n = c there is a positive (bounded) expected number of triangles, there is 
then a positive expectation that an instance generated allowing this constraint would be 
trivially unsatisfiable. Thus, it appears that weak 4-consistency is a minimal requirement 
for exponential complexity on random constraint graphs. Notice that weak 4-consistency 
not only ensures that every triangle is satisfiablc, but it also ensures (by induction basically) 
that any larger cycle is satisfiable. Thus, speaking in general terms, the only way that a 
sub-instance can be unsatisfiable is to have the subgraph contain more edges than vertices. 
But random graph analysis shows this means asymptotically the minimal unsatisfiable sub- 
instance is of size 0{n), and this is of course a key ingredient of the complexity analysis. 

Now, suppose we ensure that the constraint graph has girth g > 3. Again this tech- 
nique has been used before on specific problems, for example on graph coloring problems 
(Culberson, Beacham, Sz Papp, 1995). We wonder whether or not combining such a graph 
restriction together with a weak g + 1-consistency (weaker than weak 4-consistency) might 
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also produce instances with exponentially long resolution proofs. Note that one difficulty 
in such an analysis is that wc no longer have uniformly random graphs. 

Once we start considering the larger picture, involving multiple variables, we naturally 
must consider /c-ary constraints. As part of future research we expect to consider extending 
our model to such cases. Of course some of the experiments on the effects of increasing 
tightness presented in this paper are on 3-ary constraints (SAT). In fact, our initial foray 
into tightness started with variations of SAT. 

As a final cautionary note, we point out that it is well known that it is possible for 
CSP (i.e. SAT) instances to have exponential resolution proofs, while being resolvable in 
polynomial time by other techniques, such as Gaussian elimination. We are not certain our 
system can produce such instances, but see no explicit reason that prevents it from doing 
so. 

Acknowledgments 

A preliminary version of this paper appeared in the Proceedings of the Tenth International 
Conference on Principles and Practice of Constraint Programming (CP-2004). We thank 
the referees for their helpful comments on the conference version and the journal version. 
Thanks are also given to Dr. K. Xu for his comments on the conference version of the paper. 
This research has been supported by Natural Sciences and Engineering Research Council 
Grant No. OGP8053. Yong Gao is supported in part by a UBCO startup grant and an 
NSERC Discovery Grant RGPIN 327587-06. 

Appendix A: Proofs of the Theorems 

In this section, wc present more concepts related to the resolution complexity results stated 
in this paper and prove Theorems 2.1, 3.1, and 3.2. 

A.l Proof of Theorem 2.1 

Before providing the proof for Theorem 2.1, let us first formalize some definitions such as a 
forcer, a forbidding cycle, and an r-flower. 

Definition A.l (Forcers, Molloy & Salavatipour, 2003). A constraint Cf with var(C/) = 
{xi,X2} is called an {a, f3)-Jorcer if its nogood set is 

{(a, 7); V7 7^/3}, 

where a,f5, and 7 are domain values of the involved variables. We say that a constraint C 
contains an {a, Pi) -forcer Cf defined on the same set of variables as C if the nogood set of 
Cf is a subset of the nogood set of C. 

Definition A. 2 (Forbidding cycles and r-fiowers, Molloy &; Salavatipour, 2003). An a- 
forbidding cycle for a variable xq is a set of constraints 

Ci(xo,Xi),C2ixi,X2), . . . ,Cr-liXr-2,Xr-l), and Cr{Xr-l,Xo) 
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such that Ci{xo,xi) is an {a, ai) -forcer, Cr{xr-i,xo) is an {or-i, Ur) -forcer (a^ / ot), and 
Cj(.Tj_i, .Tj) is an {ai-i,ai) -forcer {2 < i < r — 1). We call xq the center variable of the 
a-forbidding cycle. 

An r -flower R = {Ci, ■ ■ ■ ,0^} consists of d (the domain size) forbidding cycles each of 
which has the length r such that 

1. Ci,l < i < d, have the same center variable x; 

2. each Ci is a distinct ai-forbidding cycle of the common center variable x; and 

3. these forbidding cycles do not share any other variables. 
The following facts are straightforward to establish: 

1. An r-flower consists of s = d{r — l)-\-l = dr — d-\-l variables and dr constraints; 

2. The total number of r- flowers is 

3. A constraint in contains an (a, /3)-forcer only if the pair (a, /?) is one of the 
tuples that are not considered when selecting the set of nogoods of the constraint. 

In the following, we assume that r = o{^/n). The probability that a constraint contains 
a forcer and the probability that a random instance of the flawless random CSP model 
contains an r-flower are given in the following lemma. 

/<fl-d-d+l\ 

Lemma A.l. Consider the flawless random CSP Bn^rn[M.] and define fe = ' . 

{ t ) 

1. The probability that a given constraint C{xi,X2) contains an {a, l3)-forcer is 




(A.l) 



2. Let R be an r-flower and let c = m/n, 

P{R appears in Bf:^^[M]} = e(l)(2c/e)*^^. (A.2) 

Proof. Equation (A.l) follows from the following two observations: 

1. ^ is the probability that (a,/?) is one of the tuples that are not considered when 
selecting the set of nogoods of a constraint in Bn,m[A^] and 

2. fe is the probability that the d — 1 tuples, (a, 7), 7 7^ /?, are in the set of t nogoods 
selected uniformly at random from (P — d tuples. 
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To calculate the probability that a given r-flower R appears in H^',m[A^], notice that the 
probability of selecting all the constraint edges in R is 

iZ-tr) ^ crj-jcn - 1) • • • (cn - dr + 1) 
Q NiN-l)---iN-dr + l) 

= m 



2^ * 



n 

where N = (2). Since for each fixed choice of dr constraint edges in the r-flower, the 
probability for these constraints to contain the r-fiower is {\feY^^ Equation (A. 2) follows. 

□ 

Proof of Theorem 2.1. Let c* = i^-^. We will show that if c = ^ > c*, then 

Urn contains an r-flower } = 1. (A.3) 

n— »oo ' 

Let Ir be the indicator function of the event that the r-fiower R appears in ;Bn,m[A^] and 
let 

R 

where the sum is over all the possible r-flowers. Then, -Sn,m[.A4] contains an r-flower if and 
only if X > 0. 

By Lemma A.l and the fact that s = dr — d + 1, we have 



E[X] = ^E[Ir] 
R 

= G(l)Q.!(d-l)V(-^)(2c/e)^'^^^ 



r 

= e(l)n(n - 1) ... (n - 5 + l)d*(2c/e)'^'^^^ 

Therefore, if c > c* and r = Alogn with A sufficiently large, wc have lim E[X] = 00. 

If WC can show that < -|- o(l)), then an application of the Chebyshev 

inequality will establish that lim P{X = 0} = 0. To get an upper bound on we 

n— >oo 

need a counting argument to upper bound the number of r-flowers sharing a given number 
of edges. This is done by considering how the shared edges form connected components 
(Franco & Gelder, 2003; Gao & Culberson, 2003; Molloy & Salavatipour, 2003). Here, we 
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follow the method used by Molloy and Salavatipour (2003), from which we have 



A A B:BnA=0 A \i=l j=l 

A A B:BnA=0 A \i=l j=l 

< £;2[x] + 5;/^[^^Ar,,.(p,,.)*-M (A.4) 

A \i=l j=l J 

where (1) Nij is the number of the r- flowers that share exactly i constraint edges with A 
and these i constraints forms j connected components in the constraint graph of A; and 
(2) (Pij)'^^~^ is the probability conditional on Ia, that the random CSP contains the dr — i 
constraints of a specific r-flower. In the work of Molloy and Salavatipour (2003), Nij is 
upper bounded by 

((2 + r'^f{dry-^y j\n'-'-^d'-'-^+'^-^, 

where {{2 + r'^)'^{dr'^y~^)'^jl upper bounds the number of ways to choose and arrange the j 
shared connected components for two r-flowers; n*~*~-' upper bounds the number of ways 
of choosing the remaining non-shared variables since the number of variables in each of 
the j shared connected components is at least one plus the number of edges in that shared 
component; and d^-^-J+d-'i- upper bounds the number of ways of choosing the forcing values 
in these non-sharing variables. The shared variables have to take the same forcing values 
as those in A due to the assumption that t < d (Molloy & Salavatipour, 2003). 

Since in our case d—l<t<d^ — d,itis possible for the shared variables to take different 
forcing values in different r-fiowers. Thus, an upper bound for N^j is 

((2 + r'^f{dry-^yj\n'-'-^d'. 

But in our case, the probability corresponding to (PijY^~^ is 

(N—dr-{dr-i)\ 

\ cn-i-(dr-i) I ^ 1 j, ^dr-i _ ~ \dr-i(^ f \dr-i 

(N-dr^ ^rf/ej -^^^)^N-dr^ ^d^'' 
\ cn—i I 

= e(i)(2c/e)^-^^. 
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Therefore, with c* = we have 



X] X] -^^J (^'^•^ s)'''' ' „dr-i ^dr- 
i=l j=l 



i=l 



(2 + r^)^''r-%-V^(2c/e)'^^-^^;^ ^( 



1 ^V-,^y 



i=l 

i=l •'^ 

where the last inequahty is because c > gj-- Prom this and formula (A. 4), the proof is 
completed. □ 

Remark A.l. The relatively loose upper bound c* = in the above proof may be improved 

by a factor of d by making a further distinction among the r-flowers that share forcing 
values at a different number of shared variables. But for the purpose of showing that the 
flawless random CSP also has potential embedded easy sub-problems, our upper bound for 
the constraint-variable ratio c is sufficient since the domain size d is a constant. 

A.2 Proof of Theorems 3.1 and 3.2 

Given a CNF formula J^, we use Res(.7-') to denote the resolution complexity of J^, i.e., the 
length of the shortest resolution refutation of J^. The width of deriving a clause A from J^, 
denoted hy w{J^ \- A), is defined to be the minimum over all the resolution refutations of 
the maximum clause size in the resolution refutation. The width w{T) of a formula is the 
size of the largest clause in it. Ben-Sasson and Wigderson (2001) established a relationship 
between Res(J^) and w(jF h 0): 

This relationship indicates that to give an exponential lower bound on the resolution com- 
plexity, it is sufficient to show that every resolution refutation of T contains a clause whose 
size is linear in n, the number of variables. 

Let I be an instance of the CSP and let CNP(J) be the CNP encoding of I. Mitchell 
(2002b) provided a framework within which one can investigate the resolution complexity of 
I, i.e., the resolution complexity of the CNP formula CNP(X) that encodes I, by working 
directly on the structural properties of X. Denote by var(X) the set of CSP variables in 
T and var(CNP(X)) the set of encoding Boolean variables in CNP(X). A sub-instance JT" 
of X is a CSP instance such that var(J') C var(X) and contains all the constraints of X 
whose scope variables are in var(j7). The following crucial concepts make it possible to work 
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directly on the structural properties of the CSP instance when investigating the resolution 
complexity of the encoding CNF formula. 

Definition A. 3 (Implies, Mitchell, 2002b). Let C be a clause over the encoding Boolean 
variables in var(CNF(X)). We say that a sub-instance J of I implies C , denoted as J |= C , 
if and only if for each assignment to the CSP variables satisfying J , the corresponding 
assignment to the encoding Boolean variables satisfies C. 

Definition A. 4 (Clause Complexity, Mitchell, 2002b). Let I be a CSP instance. For each 
clause C defined over the Boolean variables in var(CNF(X)), define 

li{C,Z) = min{|var(J')|; ^ is a sub-instance and implies C}. 

The following two concepts slightly generalize those used by Mitchell (2002b) and by 
Molloy and Salavatipour (2003) and enable us to have a uniform treatment when establishing 
resolution complexity lower bounds. 

Definition A. 5 (Boundary). The boundary B{J) of a sub-instance J is defined to be the 
set of CSP variables such that a variable x is in B{J) if and only if the following is true: 
If J' minimally implies a clause C defined on some Boolean variables in var(CNF(T)), then 
C contains at least one of the Boolean variables, x : a,a E D, that encode the CSP variable 

X. 

Definition A. 6 (Sub-critical Expansion, Mitchell, 2002b). Let I be a CSP instance. The 
sub-critical expansion of I is defined as 

e(I) = max min \B(J)\ (A.5) 

O<s<m{0,X) s/2<|var(J')|<s 

where the minimum is taken over all the sub-instances ofZ such that s/2 < \va,T{J')\ < s. 

The following theorem relates the resolution complexity of the CNF encoding of a CSP 
instance to the sub-critical expansion of the CSP instance. 

Theorem A. 2 (Mitchell, 2002b). For any CSP instance I, we have 

w{CNF(I) h 0) > e(J) (A.6) 

Proof. For any resolution refutation tt of CNF(X) and s < )u(0,X), Lemma 1 of Mitchell 
(2002b) shows that tt must contain a clause C with 

s/2 < n{C,l) < s. 

Let JT" be a sub-instance such that |var(j7')| = ii(C,I) and J' implies C. Since JT" minimally 
implies C, according to the definition of the boundary, w(C) > |;S(J')|. Formula (A.6) 
follows. □ 

To establish an asymptotically exponential lower bound on Res(C) of a random CSP C, 
it is enough to show that there is a constant /?* > such that 

lim P{e{C) > /3*n} = 1. (A.7) 
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For any a > 0, let Am{cx) be the event {/i(0,C) > an} and As{a,/3*) be the event 

I min B(J) >p*n\. 

[fl<|var(:r)|<an J 

Notice that 

P{e{C)>(3*n} > P{Am{a)r]As{a,p*)} 



> l-P{Amia)}-P{Asia,(3*)}. (A.8) 

We only need to find appropriate a* and fj* such that 



lim P{Am{a*)} = (A.9) 

n—*oo 

and 

lim P{As{a*,P*)} = 0. (A.IO) 

Event Am{(x*) is about the size of minimally unsatisfiable sub-instances. For the event 
As{(X*,l3*), a common practice is to identify a special subset of the boundaries and show 

that this subset is large. For different random CSP models and under different assumptions 
on the model parameters, there are different ways to achieve this. Following Beame et al. 
(2005), we say a graph G is (r, g)-dense if there is a subset of r vertices that induces at least 
q edges of G. 

Proof of Theorem 3.1. Recall that the constraint graph of Bnjm[SC] is the standard 
random graph G{n,m). 

Since each instance of Bnfm[SC] is strongly fc-consistent, variables in a minimal unsatis- 
fiable sub-instance J with jvar(i7)| = r must have a vertex degree greater than or equal to 
k, and consequently, the constraint sub-graph H{J') must contain at least ^ edges. Thus, 

P{Am(a*)} = P{fi{$,BifJSC])<a*n} 

m) is (r, rfe/2)-dense } > . 
r=k+l J 

Let B^{J) be the set of the variables in var(^) whose vertex degrees are less than k. Again, 
since instances of Bi^rn[SC] are always strongly k-consistent, we have B^{J) C B{J) and 
thus, \B{J)\ > \B''{J)\. Therefore, the probability P{As{a* , l3*)} can be bounded by 



P{As{a*,P*)}<P{Alia*,f5*)} 
where As{a*,f3*) is the event 



min B^iJ) > P*n 

a*n/2<\vax{J)\<a*n 
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Random graph arguments (see, e.g., Beame et al., 2005) show that there exist constants 
a* and P* such that P{Am{a*} and P{A'^{a*, /?*)} both tend to 0. Indeed, let P* be such 
that > 1, c = ^, and AT = We have 

P{Am{a*)} <P\ (j {G{n, m) is (r, rA;/2)-dense } I 



rk^ 



< ^ P{G(n,m) is (r, —)-dense} 



r=k+l 
a*n / \ /r(r— 1) 



r=fc+l 



n 



rk 



2 



^ ,en,„ ,e(r — 1) , rfc 2c, rfc 

r=k+l 



= E 

r=fc+l '- 

= E 

r=fc+l 
[lognj 
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Similarly, we have for /3 



2^ 

a* ' 



(A.ll) 



P{A'l{a*,/3*)} = P<. IJ {3 a size-r sub-instance J s.t. \B''{J)\ < /3*n} 

ct*n _ -^N^ 

1^ {G(n, m) is (r, )-dense} 



< P < 



s E 



2c 



(l-/9)fc 

2 (l-/3)fe+2 (l-;3)fe-2 

e 2 (a*) 2 



(A.12) 



where the second inequality is because of the fact that for a sub-instance J with size r and 
IB'^iJ)] < P*n, its constraint graph contains at least r — f3*n = r — \(3n >r — (3r vertices 
whose degree is at least k. 
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There exist a* and (3* be such that (1) ^ < 1; (2) f > 1; and (3) the right hand 
side of formula (A. 11) and the right hand side of formula (A. 12) both tend to zero. This 
completes the proof of Theorem 3.1. □ 

We now prove Theorem 3.2. First from the definition of Bi^m [WC] , we have the following 

Lemma A. 3. For the random CSP Bn%i\WC], we have 

1. Every sub-instance whose constraint graph is a cycle is satisfiable; 

2. for any path of length > 3, any compatible assignments to the two variables at the 
ends of the path can be extended to assignments that satisfy the whole path. 

In an effort to establish exponential lower bounds on the resolution complexity for a 
classical random CSP models with a tightness higher than those established by Mitchell 
(2002b), Molloy and Salavatipour (2003) introduced a collection of sub-instances, denoted 
here as Bm{J)-, and used its size to give a lower bound on the size of the boundary. For 
binary CSPs whose constraints are arc-consistent and contain no forcer, Bm{<J) consists of 
two parts: B\^{J) and B\^{J), defined respectively as follows: 

1. B\^{J) contains the set of single-edge sub-instances X, i.e., |var(A:')| = 2, such that 
at least one of the variables has a vertex degree 1 in the original constraint graph; 

2. B\^{J) contains the set of sub-instances X whose induced constraint graph is a pen- 
dant path of length 4, i.e., a path of length 4 such that no vertex other than the 
endpoints has a vertex degree greater than 2 in the original constraint graph. 

It can be shown that 

Lemma A.4. For any weakly path-consistent CSP sub-instance J , we have 

mj)\>\BUj)\ + ^-^^^- 

Proof. The variable with degree one in any sub-instance in B\j{J) has to be in B{J); At 
least one internal variable in any pendant path B\j{J) has to be in B{J). It is possible 
that several pendant paths of length 4 share a common internal variable that is in B{J), 
e.g., in a very long pendant path. But a variable can only appear in at most three pendant 
paths of length 4. □ 

With the above preparations, the proof provided for Theorem 1 of Molloy and Salavatipour 
(2003) readily applies to our case. To make this report self-contained, we give the proof 
below. 

Proof of Theorem 3.2. By Lemma A.3, any minimally unsatisfiable sub-instance J is 
such that (1) its constraint graph cannot be a single cycle; and (2) Bm{J) is empty since 

\B\i{J)\ = and \B\,i{J)\ = for a minimally unsatisfiable sub-instance. According to 
Lemma 11 of Molloy and Salavatipour, the constraint graph of J has at least j^)y&i{J) 
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edges. Therefore, due to the locally sparse property of random graphs, there is a constant 
a* > such that formula (A.9) holds, i.e., 

Hm P{Am{a*)} = 0. 

n— >oo 

To establish formula (A. 10), due to Lemma A. 4 we have 

P{Asia*,P*)}>P{AsM(^*^P*)} 
where Ag^uiot* , P*) is the event 

I min \BMiJ)\ > P*n] . 

[a'-n/2<\va.r{J)\<a*n J 

Now suppose on the contrary that for any C > 0, there is a sub-instance J' with a*n/2 < 
|var(J)| < a* such that \l3ljlj)\ + \Sm{J)\ < C"- Then, from Lemmas 10 and 11 of Molloy 
and Salavatipour, the constraint graph of contains only cycle components, where Lemma 
11 of Molloy and Salavatipour asserts that the edges-to- vertices ratio of the constraint graph 
of J' has to be bigger than one. If we remove all the cycle components from the constraint 
graph of J', the edges-to-vertices ratio of the remaining graph becomes even bigger. But 
this is impossible because the constraint graph of J', and hence the remaining graph, has 
less than a*n vertices. 

It is well-known that w.h.p. a random graph has fewer than logn cycle components 
of length at most 4; for the random graph G{m,n) with m/n = c being constant, the 
number of cycle components with a fixed length has asymptotically a Poisson distribution 
(Bollobas, 2001). Thus, the number of variables that are in cycle components of length 
4 is at most 4 log n. Since any cycle component of length Z > 4 contains I pendant paths 
of length 4, the total number of variables in cycle components of length greater than 4 is 
at most \B'j^{J')\ < C,n. Therefore, we have var(j7') < Cn-|-41ogn < a*n/2 < var(j7) for 
sufficiently small (, a contradiction. 

We, therefore, conclude that there is a /3* such that w.h.p, for any sub-instance J7 with 
a*n/2 < |var(J)| < a*, \Bm{J)\ > P*n, i.e., formula (A.IO) holds. □ 

A. 3 Upper Bound on the Threshold of the Random Restricted Binary CSP 

Bn\m [^] 

In this subsection, we show that the condition mentioned at the end of Definition 2.1 
guarantees the existence of a constant c* such that 6n.m[/^] with m/n > c* is asymptotically 
unsatisfiable with probability one. We state and prove the result for the case of binary 
constraints, but similar results also hold for the more general A;-ary constraints. 

Recall that the condition on the set of relations JC = {Li, L2, ... \ Li C D x D} is 

(a,a) ^ P|Li,Va G L>. (A.13) 
i>i 

Since we assume that the domain size is fixed, for any a & D the probability that a constraint 
in Hn',m['C] contains (a, a) as one of its nogoods is lower bounded by a constant p. 
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Theorem A. 5. If Bnfm[^] is such that the set of relations L satisfies the condition A. 13, 
then it is unsatisfiable w.h.p. if ^ = n is such that 

d(i-|r<i. 

Proof. For any assignment A, there must be a subset -S* of ^ variables that are assigned a 
common domain value a. Then, A satisfies 6n.m[-^^] only if A satisfies all the constraints 
that only involve the variables in S. Let be the event that Bnjm[j^ has i constraints that 
only involve the variables in S. Then, we have 

cn 

P{A satisfies B^f„[>C]} < - pyP{Hi}. 

i=0 

Let N = in(n- 1). Write 

for the number of possible edges induced by S, and 

N2 = N-Ni = ^{l- ^)n{n - 1) + 0(n). 
We have for any 1 < i < cn — 1 

(Ni\ / N2 \ 
\ i ) \cn—i) 



P{Hi} = 



\cnj 



< 



< 



Ni{Ni - 1) • • • (iVi - i + 1) * N2{N2 - 1) • • • (iV2 - cn + i + 1) (cn)! 

N{N-1)---{N -cn + 1) i\{cn-i)\ 



N I \N -ij \ i 

NiY f N2 fen 



N I \N -cn 
Similarly, we have 



^{^0} = ^ < 

\cn) 



N -cn 



and 



P{Hcn} - ."^^ - \ N 
\cn) 
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So, 



Therefore, 



P{A satisfies B^fM} < J^i^-pYPiHi} 

cn / nr \ i / AT at \ cn—i / 



N N -cn 



is satisfiable } < (l - = (d{l - 



□ 
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